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ABSTRACT 

Using data from the 2 degree Field Galaxy Redshift Survey (2dFGRS) we compute 
the conditional luminosity functions (CLFs) of early- and late-type galaxies. These 
functions give the average number of galaxies with luminosities in the range L ± dL/2 
that reside in a halo of mass M, and are a powerful statistical tool to link the distribu- 
tion of galaxies to that of dark matter haloes. Although some amount of degeneracy 
remains, the CLFs are well constrained. They indicate that the average mass-to-light 
ratios of dark matter haloes have a minimum of ~ 100ft- (M/L) around a halo mass 
of ~ 3 x 10 n /i _1 M . Towards lower masses (M/L) increases rapidly, and match- 
ing the faint-end slope of the observed luminosity function (LF) requires that haloes 
with M < 10 10 ft _1 Mq are virtually devoid of galaxies. At the high mass end, the 
observed clustering properties of galaxies require that clusters have 6j-band mass-to- 
light ratios in the range 500 — 1000 /i(M/L)q. Finally, the fact that early-type galaxies 
are more strongly clustered than late-type galaxies requires that the fraction of late- 
type galaxies is a strongly declining function of halo mass. We compute two-point 
correlation functions as function of both luminosity and galaxy type. The agreement 
with observations, in terms of normalization and power-law slope, is remarkably good. 
When including predictions for the correlation functions of faint galaxies we find a 
weak (strong) luminosity dependence for the late (early) type galaxies. We also in- 
vestigate the inferred halo occupation numbers. Late-type and faint galaxies reveal 
a shallower (N)(M) than bright, early-type galaxies, which explains why (N)(M) 
transforms from a single power-law for bright galaxies to a more complicated form 
when fainter galaxies are included. Finally we compare our CLFs with predictions 
from several semi-analytical models for galaxy formation. As long as these models 
accurately fit the 2dFGRS luminosity function the agreement with our predictions is 
remarkably good. This indicates that the technique used here has recovered a sta- 
tistical description of how galaxies populate dark matter haloes which is not only in 
perfect agreement with the data, but which in addition fits nicely within the standard 
framework for galaxy formation. 

Key words: galaxies: formation — galaxies: clusters — large-scale structures: cos- 
mology: theory — dark matter 



1 INTRODUCTION 

According to the current paradigm galaxies form and reside 
inside extended cold dark matter (CDM) haloes. One of the 
ultimate challenges in astrophysics is to obtain a detailed 
understanding of how galaxies with different properties oc- 
cupy haloes of different masses. This link between galaxies 
and dark matter haloes is an imprint of various complicated 



* E-mail: vdbosch@mpa-garching.mpg.de 
© 0000 RAS 



physical process related to galaxy formation such as gas cool- 
ing, star formation, merging, tidal stripping and heating, 
and a variety of feedback processes. 

One method to investigate the galaxy-dark matter con- 
nection is therefore to consider ab initio models for galaxy 
formation, using either numerical simulations (e.g., Katz, 
Weinberg & Hernquist 1996; Fardal et al. 2001; Pearce et 
al. 2000; Kay et al. 2002) and/or "semi-analytical" models 
(e.g., White & Rees 1978; Kauffmann, White & Guiderdoni 
1993; Somerville & Primack 1999; Kauffmann et al. 1999; 
Cole et al. 2000; Benson et al. 2002; van den Bosch 2002). 
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A downside of this approach, however, is that phenomeno- 
logical descriptions have to be used to describe a variety 
of poorly understood physical processes. Consequently, an 
alternative method has been developed which completely 
sidesteps the uncertainties related to how galaxies form. This 
method tries to infer the link between galaxies and dark 
matter haloes directly from the observed clustering proper- 
ties of galaxies. Since haloes of different mass and galaxies 
of different luminosity and type are all clustered differently, 
there is only a limited amount of possibilities by which one 
can distribute galaxies over dark matter haloes such that 
their clustering properties are consistent with observations. 
The backbone of this approach is the so-called halo model, 
which views the evolved, non-linear dark matter distribution 
in terms of its halo building blocks: on strongly non-linear 
scales the dark matter distribution is given by the actual 
density distributions of the virialized haloes, while on larger, 
close-to-linear scales, the dark matter distribution is given 
by the distribution of virialized haloes (see Cooray & Sheth 
2002 for a detailed review). This halo model has become 
more and more accurate due to the fact that detailed ana- 
lytical descriptions for the structure and clustering of dark 
matter haloes have become available (e.g., Navarro, Frcnk 
& White 1997; Moore et al. 1998; Bullock et al. 2001; Mo & 
White 1996, 2002; Power et al. 2002). 

The halo model can be naturally extended to address 
the bias of galaxies by introducing a model for the halo occu- 
pation numbers, (N(M)), which describes how many galax- 
ies on average (with luminosities L > L m i n ) occupy a halo 
of mass M. Numerous studies in the past have used these 
halo occupation number models to investigate how changes 
in (N(M)) impact on several statistical properties of the 
galaxy distribution, such as the real-space two- and three- 
point correlation functions, the power spectrum and bispec- 
trum of galaxies, the galaxy-mass cross correlation func- 
tion, the pair-wise velocity dispersions, etc. (Seljak 2000; 
Scoccimarro et al. 2001; White 2001; Berlind & Weinberg 
2002; Scranton 2002a; Kang et al. 2002; Marinoni & Hud- 
son 2002; Kochanek et al. 2002). In addition, several studies 
have confronted these models with data to put constraints 
on (N(M)) (Jing, Mo & Borner 1998; Peacock & Smith 
2000; Marinoni & Hudson 2002; Kochanek et al. 2002; Jing, 
Borner & Suto 2002; Bullock, Wechsler & Somerville 2002). 
With large galaxy redshift surveys becoming available, such 
as the Two-Degree Field Galaxy Redshift Survey (2dFGRS; 
see Colless et al. 2001) and the Sloan Digital Sky Survey 
(SDSS, see York et al. 2000), these models can now be con- 
fronted with statistical data of unprecedented quality to ob- 
tain stringent constraints on the halo occupation numbers, 
and therewith on both cosmological parameters and galaxy 
formation models. 

In a recent paper, Yang, Mo & van den Bosch (2002; 
hereafter Paper 1) have taken this approach one step fur- 
ther by considering the derivate of (N(M)) with respect to 
Lmin- In particular, they introduced the conditional lumi- 
nosity function (hereafter CLF) §(L\M)dL, which gives the 
average number of galaxies with luminosities in the range 
L±dL/2 that reside in haloes of mass M. The advantage of 
the CLF over the halo occupation function (N(M)) is that 
it allows one to address the clustering properties of galax- 
ies as function of luminosity. In addition, the CLF yields a 
direct link between the halo mass function n(M)dM, which 



gives the number of dark matter haloes per comoving vol- 
ume with masses in the range M ± dM/2, and the galaxy 
luminosity function (hereafter LF) $(L)dL, which gives the 
number of galaxies per comoving volume with luminosities 
in the range L ± dL/2, according to 

/>oo 

$(L) = / $(L\M)n(M)dM. (1) 
Jo 

Therefore, $(L|M) is not only constrained by the clustering 
properties of galaxies, as is the case with (JV(M)}, but also 
by the observed galaxy luminosity function. Furthermore, 
knowledge of $(L|M)dL allows one to compute the average 
total luminosity of galaxies in a halo of mass M, 

/•oo 

(L)(M) = / <fr(L|M) L dL (2) 

JO 

and thus the average mass-to-light ratio as function of 
halo mass. This (M / L)(M) yields important constraints on 
galaxy formation models, as it is a direct measure of the 
halo mass dependence of the galaxy formation efficiency. 

In Paper 1 we focussed on the cosmology dependence 
of the CLF. In this paper, we use data from the 2dFGRS to 
compute the CLFs of both early and late-type galaxies and 
compare our results with several semi-analytical models for 
galaxy formation. It is well known that galaxies of differ- 
ent morphological types have different luminosity functions 
and different clustering properties. For example, early-type 
galaxies have higher characteristic luminosities (e.g., Efs- 
tathiou, Ellis & Peterson 1988; Loveday et al. 1992) and are 
more strongly clustered (e.g., Willmer, Da Costa & Pelle- 
grini 1998; Zehavi et al. 2002) than late-type galaxies. The 
CLFs presented here allow an interpretation of these mor- 
phological dependencies in terms of halo occupation num- 
bers, while our comparison with the semi-analytical models 
links their physical origin to the framework of galaxy forma- 
tion models. 

Throughout this paper we define M to be the halo mass 
inside the radius Riso inside of which the average density is 
180 times the cosmic mean density, and h is the Hubble 
constant in units of 100 kms -1 Mpc -1 . 



2 OBSERVATIONAL CONSTRAINTS 

In paper 1 we used data from the 2dFGRS to constrain the 
CLF of the entire galaxy population. This data set included 
over 110500 galaxies with 17.0 < bj < 19.2 and z < 0.25. 
In this paper we are interested in the CLFs of the early- 
and late-type galaxies, for which we need separate LFs and 
separate measurements of the luminosity dependence of the 
clustering properties. 

Madgwick et al. (2002) used a principal component 
analysis of galaxy spectra taken from the 2dFGRS to ob- 
tain a spectral classification scheme. They introduced the 
parameter n, a linear combination of the two most signif- 
icant principal components, as a galaxy type classification 
measure. As shown by Madgwick et al. (2002), n follows a 
bimodal distribution and can be interpreted as a measure 
for the current star formation rate in each galaxy. Further- 
more n is well correlated with morphological type (Madgwick 
2002). In what follows we adopt the classification suggested 
by Madgwick et al. and classify galaxies with n < —1.4 as 
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Figure 1. The data used to constrain the models. The left panel plots the LFs of the early-type galaxies (starred symbols), the late-type 
galaxies (open triangles), and the combined sample of late- plus early- type galaxies (open circles). For clarity, no errorbars are shown 
(but see Figure 2 below). The dotted vertical lines indicate the luminosity range over which measurement of clustering correlation lengths 
are available. Right panel plots these galaxy-galaxy correlation lengths as function of luminosity (same symbols); Again, for clarity no 
errorbars are shown (but sec Figure 2). 



'early-types' and galaxies with r\ > — 1.4 as 'late-types'. We 
caution the reader that despite the good correlation between 
r) and morphological type, there is not a one-to-one corre- 
spondence between our description of early- and late-type 
galaxies and those obtained using a more morphological cri- 
terion. 



the LFs of the combined and late-type samples are well fit 
by a Schechter (1976) function (see Madgwick et al. (2002) 
for the parameters), (ii) early- (late-) type galaxies domi- 
nate the total LF at the bright (faint) end, and (iii) the LF 
of the early-type galaxies reveals a remarkable, though only 
marginally significant, upturn at the faint-end. 



2.1 Luminosity Functions 

In order to constrain the CLFs of early- and late-type galax- 
ies we adopt the LFs in the photometric bj band computed 
by Madgwick et al. (2002) from the 2dFGRS. This sample is 
restricted to the redshift range z < 0.15 and contains 75589 
unique galaxies with accurate type-classification based on n. 
About 36 percent of these galaxies have n < —1.4 and make 
up what we refer to as the sample of early-type galaxies; the 
remainder makes up the late-type galaxies. In what follows, 
we shall refer to the entire sample (both late- and early-type 
galaxies) as the "combined sample" . 

The LFs obtained from this sample of galaxies have 
been corrected for completeness effects, and a self-consistent 
method for fc-corrections, based on the observed 2dF spec- 
tra, has been applied. A cosmology with flo = 0.3 and 
CIa =0.7 has been adopted, which corresponds to the same 
cosmological parameters as we adopt throughout this paper. 

The left panel of Figure 1 plots the LFs for the com- 
bined sample (circles) , as well as its contributions from late- 
type (triangles) and early-type (starred symbols) galaxies. 
In their paper, Madgwick et al. (2002) subdivided the late- 
type galaxies in three sub-types. The late-type LF used here 
is computed by summing the LFs of these three subtypes, 
and by adding the errors in quadrature. For clarity, no er- 
rorbars have been plotted, but these can be seen in Figure 2 
below. The main characteristics of these LFs are that (i) 



2.2 Correlation Lengths 

It is straightforward to see that there is an infinite set of 
CLFs that, given a halo mass function, can reproduce the 
observed LF $ b s (L) of galaxies. For example, all CLFs 
given by $(L|M) = [$ obs (L)/n(M )]<5 fe (M - M ), where 
8 k (x) is the Kronecker delta function and Mo is an arbi- 
trary mass, yield exactly the same LF, namely <I? bs(L) (see 
equation [1]). Therefore, in order to constrain the CLF, ad- 
ditional constraints are required. Since more massive dark 
matter haloes are more strongly clustered (e.g., Mo & White 
1996), galaxy-galaxy correlation lengths as function of lumi- 
nosity, ro(L), put important constraints on how galaxies of 
different luminosities have to be distributed over haloes of 
different masses. Indeed, as shown in Paper 1, a combina- 
tion of an observed LF with measurements of ro(L) allow 
the CLF to be well constrained. 

Because of the shear size of the 2dFGRS it is possible 
to compute the clustering properties of galaxies of differ- 
ent spectral types and luminosities in representative volume- 
limited samples. Norberg et al. (2002) obtained real space 
galaxy-galaxy two-point correlation functions £ gg (r) from 
the same sample of 2dFGRS galaxies as that used by Madg- 
wick et al. (2002) for the LFs. For a number of volume- 
limited subsamples (specified by a range in absolute magni- 
tude) Norberg et al. computed £gg( r ) (assuming Qo = 0.3 
and = 0.7) separately for the late-type, the early-type, 
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and the combined samples. Each of these correlation func- 
tions is well fit by a simple power-law £ gg (r) = (r/ro) 7 . 
Although there is no clear trend of 7 with either luminos- 
ity or spectral type (see Section 6), the correlation lengths 
ro depend strongly on both luminosity and spectral type. 
This is shown in the right panel of Figure 1, which plots 
ro(L) for the late-type galaxies (triangles), the early-type 
galaxies (starred symbols), and the combined sample (cir- 
cles). For clarity, no errorbars are plotted (but see Figure 2 
below). Two trends are apparent: typically r*o increases with 
luminosity (i.e., more luminous galaxies are more strongly 
clustered) and early-type galaxies are more strongly clus- 
tered than late-type galaxies of the same luminosity. This 
type-dependence is particularly strong for the fainter galax- 
ies. 

Note that the correlation function itself may also evolve 
with redshift, an effect that has not been corrected for. For 
the redshift range covered by the 2dFGRS data used here 
(i.e., z < 0.15), this effect is fairly small, but we nevertheless 
take it into account in our modeling (see Section 3 below). 



3 THEORETICAL BACKGROUND 

Our main goal in this paper is to use the observed LFs and 
correlation lengths to constrain the CLFs of the early-type 
and late-type galaxies. Here we briefly describe how to com- 
pute LFs and correlation functions from a given CLF. More 
details can be found in Paper 1 and references therein. 

To compute the galaxy LF $(L) from the CLF $(L|M) 
(equation [1]) one needs the (cosmology-dependent) mass 
function n(M) of dark matter haloes, which (at 2 = 0) is 
given by 



W M (k) 



n(M)dAf=^/M 



diner 



dlnM 



AM. 



(3) 



Here p is the mean matter density of the Universe at z = 
0, v = 8 c /a(M), 8 C is the critical overdensity required for 
collapse at z = 0, f{v) is a function of v to be specified 
below, and a(M) is the linear rms mass fluctuation on mass 
scale M, which is given by the linear power spectrum of 
density perturbations P(k) as 



\M) 



Wo 



P(k) W 2 M {k) k 2 Ak, 



(4) 



where Wu{k) is the Fourier transform of the smoothing filter 
on mass scale M. 

Throughout we adopt the form of f(u) suggested by 
Sheth, Mo & Tormen (2001): 



(5) 



vJ{y)= 0.644 (1 + -^) (^) 1/2 exp(-^) 

with v 1 — 0.841 v. The resulting mass function has been 
shown to be in excellent agreement with numerical simula- 
tions, as long as halo masses are defined as the masses inside 
a sphere with an average overdensity of about 180 (Sheth & 
Tormen 1999; Jenkins et al. 2001; White 2002). Therefore, 
in what follows we consistently use that definition of halo 
mass, and we use the CDM power spectrum P(k) of Efs- 
tathiou, Bond & White (1992) with a spatial top-hat filter 
for which 



(kRf 



[sin(fc-R) - kRcos(kR)] 



(6) 



where the mass M and filter radius R are related according 
to M = 4npR 3 /3. 

In order to compute ro(L) from the CLF we proceed 
as follows. In the halo model it is natural to consider the 
galaxy-galaxy two-point correlation function, £gg( r )i to be 
build up from two parts; a 1-halo term, £ g g(r), which rep- 
resents the correlation due to pairs of galaxies within the 
same halo, and a 2-halo term, £ g g(»"), describing the contri- 
bution due to pairs of galaxies that occupy different haloes. 
The observed correlation lengths are all well in excess of 
3.5/i -1 Mpc. At radii this large the contribution from the 
1-halo term is negligible and for the purpose of calculating 
correlation lengths only the 2-halo term is required. 

As we show in Appendix A, the 2-halo term of £gg( r ) 
for galaxies with L\ < L < L2 can be written as 

4V) = & 2 dm(r), (7) 

with £dm( r ) the 2-halo term of the dark matter mass corre- 
lation function at z = 0, which at scales of the correlation 
length is well fit by a single power-law 



fdm(r) ^ Cdm(r) 



m,dm 



and 



b = 



r o °° n (M) (N(M)) b(M) AM 
/ o °° n(M) (N(M)} AM 



Here 



(N(M)) 



J L x 



$(L|M) dL 



(8) 



(9) 



(10) 



is the mean number of galaxies in the specified luminosity 
range for haloes of mass M, and b(M) is the bias of dark 
matter haloes of mass M with respect to the dark matter 
mass distribution (see Appendix A). Note that the 2-halo 
term of the galaxy-galaxy correlation function is completely 
specified by the CLF and does not require knowledge about 
how galaxies are distributed inside individual dark matter 
haloes. 

Since 6 is scale-independent, we can rewrite equation (7) 
directly in terms of the correlation length for galaxies as 



T 1.143 

n> = b r ,dn 



(11) 



There is one additional effect, though, that we have 
to take into account. The correlation functions obtained by 
Norberg et al. (2002) have not been corrected for possible 
redshift evolution. This implies that the ro(L) measurements 
do not correspond to z = 0. In fact, since more luminous 
galaxies can be detected out to higher redshift, the correla- 
tion lengths of brighter galaxies correspond to galaxy pop- 
ulations with a higher median redshift. Therefore, in order 
to compare model and observations in a consistent way, we 
have to compute the correlation lengths at the characteris- 
tic redshift of the sample in consideration. We take this into 
account by replacing 6 in equation (11) with b e g(z). Here z 
is the mean redshift of the galaxies in consideration (taken 
from Norberg et al. 2002) and b e g is the effective bias de- 
fined in Appendix B. Since the 2dFGRS sample used here is 
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limited to z < 0.15, this redshift correction is only modest, 
amounting to no more than a few percent change in ro- 



4 MODELING THE CONDITIONAL 
LUMINOSITY FUNCTION 

In Paper 1 we adopted a particular parameterization of 
the CLF for the entire galaxy population (early plus late 
type galaxies), and we used a \ minimization routine to 
find those parameters that best fit the observed and 
ro(L). Here we seek to constrain two independent CLFs, 
namely that of the early-type galaxies, hereafter $ e (L|M), 
and that of the late-type galaxies, hereafter $;(L|M). In 
principle we could use the same parameterizations as in Pa- 
per 1 for both of these CLFs independently. However, there 
is an additional constraint that these CLFs have to fulfill: 
their sum must be equal to the CLF of the entire galaxy 
population, and thus be consistent with the and ro(L) 

of the combined sample. We therefore use a slightly differ- 
ent, two-step method which automatically obeys the con- 
straint that $(L|M) = $i(L|M) + $ e (L|M). We first use 
the same method as in Paper 1 to obtain the CLF $(L|M) 
of the combined sample (step one). Next we introduce the 
function fi (L, M) , which specifies the late-type fraction of 
galaxies with luminosity L in haloes of mass M. The CLFs 
of late- and early-type galaxies are then given by 



$ ; (L|M)dL = f t (L, M) $(L\M)dL 
and, by definition, 

$> e (L\M)dL = [1 - fi(L,M)] $(L|M)dL 



(12) 



(13) 



What remains (step two) is to find the fi{L, M) that, given 
our best-fit CLF for the combined population, best fits the 
LFs and correlation lengths of the early- and late- type galax- 
ies. 

Following Paper 1 we assume that the CLF of the com- 
bined sample can be described by a Schechter function: 



<E>(L|M)dL = j- 



exp(-L/Z*)dL 



(14) 



Here L* = L*(M), a = a(M) and $* = $*(M); i.e., the 
three parameters that describe the conditional LF depend 
on M. In what follows we do not explicitly write this mass 
dependence, but consider it understood that quantities with 
a tilde are functions of M. 

We adopt the same parameterizations of these three pa- 
rameters as in Paper 1, which we repeat here for complete- 
ness. Readers interested in the motivations behind these par- 
ticular choices are referred to Paper 1. For the total mass- 
to-light ratio of a halo of mass M we write 



/ M \ ~ 71 / M 
\~M~J + \W C 



(15) 



which has four free parameters: a characteristic mass M c , 
for which the mass-to-light ratio is equal to (M/L)o, and 
two slopes, 71 and 72, which specify the behavior of (M/L) 
at the low and high mass ends, respectively. A similar pa- 
rameterization is used for the characteristic luminosity U: 



M 



L*(M) 



' M 

M 



\M 2 ) 



(16) 



Here 



/(&) = 



r(Q + i,i)' 



(17) 



with r(x) the Gamma function and T(a, x) the incomplete 
Gamma function. This parameterization has two additional 
free parameters: a characteristic mass M2 and a power-law 
slope 73. For a(M) we adopt: 



a(M) = ais + C log(Mis). 



(18) 



Here Mi 5 is the halo mass in units of 10 15 H~ 1 Mq, ais = 
3 (Mi 5 = 1), and £ describes the change of the faint-end 
slope a with halo mass. Note that once a and L* are given, 
the normalization 3>* of the CLF is obtained through equa- 
tion (15), using the fact that the total (average) luminosity 
in a halo of mass M is given by 



<L)(M) 



f 

Jo 



*(L|M) L dL = 3>* L* r(a + 2). 



(19) 



Finally, we introduce the mass scale M m i n below which the 
CLF is zero; i.e., we assume that no stars form inside haloes 
with M < M m i n . Motivated by reionization considerations 
(see Paper 1 for details) we adopt M m i n = lO 9 /^ 1 M 
throughout. This lower-mass limit does not significantly in- 
fluence our results. For instance, changing M m - ln to either 
10 8 /i -1 Mq or 10 10 /i _1 M has only a very modest impact 
on the results presented below. 

In order to split the CLF in early- and late- type galaxies 
we make the assumption that /; (L, M) has a quasi-separable 
form 



fl {L, M) = g(L) h(M) q(L, M) 
Here 



q(L,M) = 



g(L)h(M) 



if g{L) h(M) < 1 
if g(L)h(M) > 1 



(20) 



(21) 



is to ensure that ft(L,M) < 1. For the g(L) and h(M) 
adopted here (see below), q(L,M) — 1 for the vast ma- 
jority of the relevant (L, M) parameter space, and therefore 
fi(L, M) can be considered separable to good accuracy. This 
basically implies that the shapes (but not the absolute val- 
ues) of the conditionals /;(L|M) and /;(M|L) are indepen- 
dent of M and L, respectively. Note that g(L) is not the 
same as the fraction Ti (V) of all galaxies with luminosity L 
that are late-type, which instead is given by 



ML) _ Jo°° fi(L,M)$(L\M)n(M)dM 



/ °° $(L|M) n(M) dM 



(22) 



Similarly, the fraction of all galaxies in haloes of mass M 
that are late-type is given by 



f,(L,M)$(L\M) dL 
J~t>(L\M)dL 



(23) 



The challenge now is to find the g(L) and h(M) that, 
given the CLF for the combined sample, reproduce the ob- 
served LFs and ro(L) of the early- and late- type galaxies. 
To achieve this we use the following estimate for g(L): 



g(L) 



ML) J ° C HL\M)n(M)dM 
4>(L) $(L\M)h(M)n(M)dM 



(24) 
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where &i(L) and &(L) correspond to the observed LFs of 
the late-type and combined galaxy samples, respectively. 
For any L for which fi(L,M) is separable (i.e., for which 
q{L,M) = 1 for all M), this implies that the LFs of the 
early- and late-type galaxies are given by 



(25) 



i.e., as long as the observed LF of the combined sample is 
well fit by the model, the same will be true for the LFs 
of the early- and late-type galaxies. For the L for which 
q(L, M) < 1 a (typically small) correction to (25) applies. 
What remains is to find the h(M) that best reproduces the 
r (L) of the early- and late- type galaxies. This requires the 
average biases b; and b e for late- and early-type galaxies, re- 
spectively, which are obtained from equations (9) and (10) 
by replacing 3>(L|M) with $ ( (L|M) and <E> e (L|M), respec- 
tively. 

After experimenting with a variety of different func- 
tional forms for h(M) we finally decided to adopt 



h(M) — max j 0, min 



/ log(M/M ) 
I log (Mi /M ) 



(26) 



Here Mo and Mi are two free parameters, defined as the 
masses at which h(M) takes on the values and 1, respec- 
tively. Note that both h(M) and g(L) can in principle take 
on any positive value. However, for computational reasons 
we limit h(M) to take values in the range [0, 1]. This does 
not impact our results, as any linear scaling of h(M) reflects 
itself in g(L) through equation (24). 

Our model for the three CLFs thus contains a total of 
10 free parameters: 4 characteristic masses; M c , Mo, Mi 
and M2, four parameters that describe the various mass- 
dependencies 71, 72, 73 and C, and 2 normalizations; one for 
the mass-to- light ratio, (M/L)o, and one for the faint-end 
slope of the CLF, 015. Although this may seem an awful 
lot, it is important to realize that we use this model to fit 67 
independent data points^. Furthermore, as we show in Sec- 
tion 5 below one can actually use observational constraints 
to fix several of these free parameters. In addition, the data 
is of sufficient quality to constrain the model freedom. Al- 
ternatively, we could have chosen a more restrictive (with 
fewer free parameters) form for the CLFs, but lacking both 
observational and physical motivations for a more preferred 
form of the CLF we felt the need to be sufficiently general. 
On the other side, one might argue that because of this lack, 
our model is actually too constrained. For instance, the fact 
that we assume a Schechter function for the CLF is at best 
weakly motivated by the observed LFs of clusters of galax- 
ies (e.g., Muriel, Valotto & Lambas 1998; Beijersbergen et 
al. 2002; Trentham & Hodgkin 2002; Martinez et al. 2002). 
However, as we show below, our model can accurately fit 
all observations. Therefore, the use of more general mod- 
els would only results in a larger amount of degeneracy and 
will therefore have to await more stringent constraints from 
either the SDSS or the completed 2dFGRS. 



t Because of our definition of g{L) only 67 of the in total 155 
data points are independent. 



5 RESULTS 

The cosmological model sets the halo mass function 
n(M)dM, the dark matter two-point correlation function 
Cdm(r), and the halo bias function b(M). Therefore, differ- 
ent cosmologies imply different halo occupancy functions. In 
this paper we focus on the ACDM cosmology with Qq — 0.3, 
tt A = 0.7, h = 0.7, r = n ft = 0.21, and a 8 = 0.9. 
These cosmological parameters are consistent with recent 
measurements of the cosmic microwave background (e.g., 
Pryke et al. 2001) and large scale structure measurements 
(e.g., Tegmark, Hamilton & Xu 2002). In addition, we have 
shown in Paper 1 that this cosmology is also the most suc- 
cessful in explaining the LF and galaxy clustering luminosity 
dependence observed. Therefore, in what follows, we restrict 
ourselves to this "concordance" cosmology. 

Having specified our model, the observational con- 
straints, and the cosmological parameters, we now proceed 
as follows. We first use Powell's multi-dimensional direction 
set method (e.g., Press et al. 1992) to find the parameters 
of the CLF of the combined sample that minimize 



X = 
Here 



x 2 W x 2 M 

N® N r 



X 



r -,2 

r S(Li)-*(Li) 



and 



2 m = £ 



r {Li) - f (Li) 



Aro(Li) 



(27) 



(28) 



(29) 



with $(!/,) and f (Li) the observed values, and N$ — 44 
and N r = 8 the number of corresponding data points. 

Note that the scaling of \ with N<s> and N r implies 
that we assign equal weights to X 2 (^) an d X 2 ( r o). This dif- 
fers from the proper statistical \ 2 f° r which each individual 
measurement should receive equal weights. However, since 
N<s> > N r and since the errors on 5>(L) are typically much 
smaller than the errors on fo(L), the \ 2 minimization rou- 
tine would give much more weight to fitting the LF than 
to fitting the correlation lengths. Note that the proper \ 2 
is only well defined if there are no systematic errors in the 
measurements and if the error properties of $(L) and fo(L) 
are the same. It is unlikely that these requirements are ful- 
filled, and our modified \ 2 is therefore equally meaningful 
as that of the proper \ 2 ■ 

We start our investigation using all available model free- 
dom, which means that we minimize \ 2 over a total of 10 
free parameters. In the first step we find the 8 parameters of 
the CLF that best fits the LF and correlation lengths of the 
combined sample. In the second step, we then find the Mo 
and Mi, describing h(M), that best reproduce the observed 
clustering properties of the early- and late- type galaxies. We 
refer to this model with maximum freedom as model A. 

The resulting best-fit parameters are listed in Table 1, 
and the fits to the data are shown in the upper panels of Fig- 
ure 2 (solid lines). Overall the fit to the data is remarkably 
good. The fact that the model reproduces even the small 
scale features in the LF of the early-type galaxies is a direct 
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Figure 2. Results for model A. The upper two panels plot the observed LFs (left panel) and correlation lengths (right panel). Symbols 
are the same as in Figure 1, while the solid lines indicate the model results. For clarity, the LFs have been separated from each other by 
one order of magnitude in the y-dircction, with the combined LF untranslated. Similarly, the tq(L) of the early- and late-type galaxies 
are offset by +2h~ Mpc and — 2/i _1 Mpc, respectively. Note the good quality of the fits. The lower panels plot the fraction of late-type 
galaxies as function of luminosity (left) and halo mass (right). 



consequence of the way in which we have defined g(L) (see 
Section 4). 

The lower two panels of Figure 2 plot the number frac- 
tions of late- type galaxies as function of luminosity, !Fi(L), 
and mass, !Fi(M). The fraction of late-type galaxies de- 
creases with both increasing luminosity and mass. Note 
that this is, at least qualitatively, in agreement with the 
well-known morphology-density relation. The 'noisy' wig- 
gles in Fi{L) are due to the fact that g(L) is computed 
directly from the observed LFs, while the feature in !Fi(M) 
for M < 3 x 10 10 /i _1 Mq is a consequence of the upturn in 
the LF of early-type galaxies at L < 10 8 h~ 2 L . 

The upper-right panel of Figure 3 plots the total, av- 



erage mass-to-light ratio of model A as function of halo 
mass, computed using equation (2). The (M / L){M) reveals 
a sharp minimum of ~ HO/i(M/L) at M ~ 2xlO 11 ft- 1 M . 
For M < 2 x lO 11 /). -1 M© the mass-to- light ratios increase 
dramatically with decreasing mass, such that in haloes with 
M < 10 10 /i _1 M Q virtually no (blue) light is produced. This 
sharp increase of (M/L) with decreasing halo mass is re- 
quired in order to bring the steep slope of the halo mass 
function at low M in agreement with the relatively shallow 
faint-end slope of the observed LF. In the language of galaxy 
formation models; low-mass haloes need efficient feedback 
to prevent an overabundance of faint galaxies. For haloes 
with M > 2 x lO 11 ^ -1 M the (average) mass-to-light ratio 
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Figure 3. A comparison of the five models discussed in the text. The upper panel on the left plots the correlation lengths as function of 
luminosity. Symbols with errorbars correspond to the 2dFGRS data with the ro(L) of the late-type and early-type galaxies translated by 
— 2h~ 1 Mpc and +2h~ 1 Mpc, respectively. The fits of these models to the observed LFs are not shown, since the differences between the 
various models can not be discerned by eye. The other three panels plot, for all five models, the inferred mass-to-light ratios (M / L)(M) 
(upper right), the late-type fractions Ti(M) (lower left) and the average total number of galaxies (N tot (M)) (lower right). Model E 
predicts mass-to-light ratios and late-type fractions for cluster-sized haloes that are inconsistent with data. In order to discriminate 
between the other models, better constraints on ro for faint, early-type galaxies and/or accurate, independent measurements of JT ; (M) 
and/or (M/L) for haloes with M > 10 14 /i _1 Mq are required. 



increases roughly as (M/L) oc M , such that clusters of 
10 15 /i -1 Mq have, on average, a (blue) mass-to-light ratio 
of ~ 1000ft (M/L) Q . Within the context of galaxy formation 
models, this increase in mass-to-light ratio is interpreted as 
a decrease of the cooling efficiency in more massive haloes 
(e.g., White & Rees 1978; White & Frenk 1991). 

By construction, (M/L) oc M 72 for haloes with M > 
M c . However, several studies have suggested that on cluster 
mass scales (M/L) varies only weakly with mass (e.g., Bah- 



call, Lubin & Norman 1995; Bahcall et al. 2000; Kochanek et 
al. 2002). Furthermore, Fukugita, Hogan & Peebles (1998), 
using a variety of observational constraints, derived that 
clusters of galaxies have blue mass-to-light ratios of 450 ± 
100/t (M/L)q, i.e., more than 5a lower than what model A 
predicts for a Coma sized cluster. Therefore, we now consider 
a model (model B) in which we keep (M/L) fixed at a con- 
stant value of (M/L) c \ for haloes with M > 10 14 /i _1 M . 
Motivated by Fukugita et al. (1998) we adopt (M/L) c] = 
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Table 1. Model parameters. 



ID 




(M/L) d 


logM c 


logMo 


logMi 


logM 2 


(M/L) 


71 


72 


73 


c 


C*15 


x 2 ($) 


x 2 M 


(1) 


(2) 


(3) 


(4) 


(5) 


(6) 


(7) 


(8) 


(9) 


(10) 


(11) 


(12) 


(13) 


(14) 


(15) 


A 


13.20 




10.85 


16.64 


8.60 


12.15 


136 


2.35 


0.26 


0.73 


-0.19 


-1.09 


65.0 


5.8 


B 


13.03 


500 


10.72 


17.27 


7.92 


12.21 


128 


3.04 


0.27 


0.73 


-0.19 


-1.20 


64.6 


5.2 


C 


13.25 


544 


11.11 


16.71 


8.83 


11.82 


115 


1.36 


0.34 


0.65 


-0.39 


-1.27 


64.7 


5.6 


D 


13.25 


500 


10.94 


17.26 


10.86 


12.04 


124 


2.02 


0.30 


0.72 


-0.22 


-1.10 


64.3 


5.4 


E 


13.50 


924 


12.67 


15.02 


10.11 


11.60 


89 


0.63 


0.99 


0.69 


-0.20 


-0.73 


71.0 


6.3 


Bl 


13.03 


500 


10.72 


17.00 


12.00 


12.21 


128 


3.04 


0.27 


0.73 


-0.19 


-1.20 


64.6 


5.4 


B2 


13.03 


500 


10.72 


15.00 


13.90 


12.21 


128 


3.04 


0.27 


0.73 


-0.19 


-1.20 


64.6 


5.6 



Column (1) lists the ID by which we refer to each model in the text. Columns (2) to (13) list the best-fit model parameters, where 
parameters that were kept fixed during the fitting procedure are type-set in boldface. Here Ml is defined such that L*(Ml) = L* (see 
Section 5), (M/L) c i is the mass-to-light ratio of haloes with M > 10 14 /i _1 Mg, and 015 is the faint-end slope of the conditional LF for 
haloes with M = 10 15 /i~ 1 Mq. Columns (14) and (15) list the values of x 2 (^) an d X 2 ( r o) of the best-fit model, respectively. Here x 2 (^) 
corresponds to the x 2 of the fit to the LF of the combined sample only (N§ = 44), whereas x 2 ( r o) is summed over all ro measurements of 
all three samples (combined, early- and late-type; 7V r = 23). Masses and mass-to-light ratios are in h^ 1 Mq and /i(M/L)q, respectively. 




log[M ] (h-'M ) 

Figure 4. Contour plot of x 2 ( r o) as function of Mo and Mi for 
model B. Contours correspond to X 2 ( r o) = 5.5, 6.5, 7.5, 14.5. 
The location of the best-fit model is indicated by a thick dot 
labeled 'B'. The locations of models Bl and B2, discussed in the 
text, are also indicated. The dashed, diagonal line corresponds 
to Mq = Mi and indicates the boundary above which no good 
fits can be obtained. For models with Mi < Mo there is a large 
area in (Mo, Mi) parameter space that yields virtually identical 

x 2 M- 



500/i (M/L) Q . Continuity of (M/L){M) then sets the pa- 
rameter 72 which is therefore no longer a free parameter. 
With respect to model A, the minimum of {M / L){M) oc- 
curs at a somewhat lower mass of ~ 10 n /i _1 M , the frac- 
tion of late-type galaxies in massive haloes has increased, 
and the fit to the correlation lengths has improved, espe- 
cially for the faint early- type galaxies (see Figure 3). This 
is easy to understand: Lowering the mass-to-light ratios on 
cluster scales means that more galaxies have to reside in clus- 
ters. Since more massive haloes are more strongly clustered 



(Mo & White 1996) and the majority of cluster galaxies are 
early- types, this leads to an increase of ro which is most 
pronounced for the early-types. 

Although overall model B is in better agreement with 
the data than model A, it also has an unattractive feature. 
This is illustrated in the lower right panel of Figure 3 where 
we plot the average, total number of galaxies 

(Ntot(M)) — / $(L|M)di = $*r(Q + l) (30) 
Jo 

as function of halo mass. Model B predicts that this 
number is less than unity for haloes with masses around 
10 11 hT 1 M . Yet, this mass scale coincides with the min- 
imum of (M/L)(M). This implies that these haloes must 
have an extremely large spread in M/L; the majority of 
haloes contain zero galaxies (or only "dark" galaxies, which 
produce no light), while a small fraction harbors one (or 
more) relatively bright galaxy (in order to explain the 
{M/L}). This aspect of the model is due to the fact that 
the CLF for haloes of this mass scale has a relatively high 
characteristic luminosity L* . If we define Ml as the mass 
for which L*{M) = L* , where L* = 1.1 x 10 1() /r 2 L© is the 
characteristic luminosity of the observed LF of the combined 
sample (see Madgwick et al. 2002), we find for model B a rel- 
atively low value of M L ~ 10 13 /i _1 Mq (see Table 1). Since 
the abundance of haloes with this mass is relatively high, 
reproducing the observed abundance of L* galaxies requires 
a significant fraction of haloes with dark or no galaxies. Al- 
though the existence of dark galaxies is interesting in itself 
(see Verde, Oh & Jimenez 2002), a large scatter in M/L for 
haloes with M ~ 10 n /i -1 Mq seems inconsistent with the 
small scatter in the observed Tully-Fisher and Fundamental 
Plane relations (see also discussion in Paper 1). 

Therefore, in models C and E we tune the value of M2 
such that M L = lO 13 ' 25 ^" 1 M and 10 13 ' 5 /i _1 Mq, respec- 
tively (i.e., M2 is not a free parameter here). We again set 
M/L constant for haloes with M > 10 14 /i _1 M Q , but we 
let the value of {M/L) a be a free parameter. These models 
thus have the same number of free parameters as model B. 
As can be seen from Figure 3, model C is remarkably sim- 
ilar to model A. This is mainly due to the fact that the 
value of M L of model A is with in 13 26 close to that of 
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M bj -51ogh log[M] (h->M ) 

Figure 5. Correlation lengths and late-type fractions as function of magnitude and halo mass, respectively, for models B, Bl, and B2. 
Although these three models yield virtually indistinguishable ro(L), they imply Ti{M) that are quite different. Note that model B2 can 
be rejected based on its prediction of zero late-type galaxies in haloes with M > 10 15 fe _1 Mq. 



model C (see Table 1). The best-fit value of {M/L) c \ is 
with 615ft (M/L)q consistent with the value advocated by 
Fukugita et al. (1998) at the 2a level. Model E, however, is 
dramatically different. The minimum of {M /L)(M) occurs 
at a significantly higher mass scale of ~ 2.5 x lO 12 /^ 1 Mq. 
In addition, {M/L) c \ ~ 1000ft (M/L) and Ti = for 
haloes with M >, lO^ft" 1 Mq, both of which are inconsis- 
tent with observations. Furthermore, model E yields a signif- 
icantly poorer fit to the observed correlation lengths of the 
faint early-type galaxies, and we therefore no longer consider 
model E in what follows. 

Finally, in model D, we combine the constraints from 
models B and C above; model D has therefore only 8 
free parameters together with Ml = 10 13 ' 25 /i _1 Mq and 
(M/L) c i = 500ft (M/L) Q . With respect to model C this 
causes a small increase in correlation lengths (again because 
more light is added to the strongly clustered massive haloes), 
though the effect is sufficiently small that model D can still 
be considered as yielding an overall good fit. 

Figure 4 plots contours of constant x 2 ( r o) in the Mo 
versus Mi plane for model B (results for the other models 
discussed above are very similar). The dashed diagonal line 
corresponds to Mo = Mi (i.e., h(M) is a step- function) . 
Clearly, models with Mo < Mi (for which the fraction of 
late-type galaxies increases with halo mass) result in poor 
fits as the value of X 2 ( r o) is always extremely large. This 
is easy to understand. Since the correlation lengths of the 
late-type galaxies are smaller than those of the early-type 
galaxies, and since more massive haloes are more strongly 
clustered, reproducing the observed ro(L) requires that the 
late-type fraction decreases with halo mass, and thus that 
Mo > Mi. However, as is also apparent from the contour 
plot, a large area in the Mo-Mi plane yields roughly equally 
good fits. To illustrate this "degeneracy" we have also com- 
puted two additional models, Bl and B2, for which we keep 
all parameters identical to that of model B, but we modify 



Mo and Mi so that these models fall in the region of roughly 
the same X 2 ( r o) (indicated by solid dots in Figure 4). Note 
that changes in ft(M) only affect the ro(L) of the early- 
and late-type galaxies; all LFs and the correlation lengths 
of the combined sample are unaffected. The results for mod- 
els Bl and B2 are shown in Figure 5. As is also apparent 
from their values of X 2 ( r o) listed in Table 1, these models 
are virtually indistinguishable from model B and from each 
other. However, they do yield somewhat different J~i{M). 
Model B2 for instance yields late-type fractions of zero for 
M > 10 15 ft _1 M . This is inconsistent with data, which al- 
lows us to rule against this particular model. However, mod- 
els Bl and B have quite similar !Fi(M), and current data is 
not sufficient to discriminate between these two alternatives. 
This degeneracy also explains why we have adopted a fairly 
simple form for ft(M); more complicated forms, with more 
free parameters, only increases the amount of degeneracy. 

Having shown that there are different models that fit 
the data roughly equally well, the question arises whether 
one can discriminate between these different halo occupancy 
models. Based on the observed mass-to-light ratios of clus- 
ters, and on the amount of scatter inferred from the Tully- 
Fisher and Fundamental Plane relations, one might argue 
that of the models presented above, model C is to be pre- 
ferred (modulo some uncertainties in Mo and Mi). In what 
follows, however, we always compare predictions from dif- 
ferent models. This gives an idea about the extent of the 
uncertainties in the models, and the accuracy in the data re- 
quired to allow to discriminate between the various models. 
Some clues are already apparent from Figure 3. As we have 
shown above, the models are extremely sensitive to the ex- 
act value of Ml and most of the uncertainties in the models 
can be translated to values of Ml in the range lO 13 /),- 1 M 
to 10 13,5 ft _1 Mq. In order to further constrain this crucial 
parameter more accurate measurements are required of the 
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mass-to-light ratios and late-type fractions in clusters or of 
the correlation lengths of galaxies with Mbj — 51og/i > — 18. 

In summary, within the concordance cosmology sev- 
eral models can be found that fit the data remarkably 
well. Although some amount of degeneracy exists, the mean 
trends reveal an average mass-to-light which has a mini- 
mum of ~ 100 h (M/L) at around 3 x to 11 /! -1 M Q , and 
which increases extremely rapidly towards lower masses: in 
all cases we find (M/L) > 3000 ft (M/L) for haloes with 
M < 10 1() ft _1 M.q. The correlation lengths require that the 
fraction of late-type galaxies decreases from about 90 per- 
cent for haloes with M < 10 12 /i _1 Mq to anywhere between 
and 40 percent on scales of M — 10 15 Ii~ 1 Mq, in qualitative 
agreement with the morphology-density relation. Similarly, 
the observed LFs indicate a similar trend with luminosity: 
about 90 percent of all galaxies with L < 10 9 /i~ 2 Lq are late- 
types, while this drops to about 20-30 percent for galaxies 
brighter than L* . 



6 TWO POINT CORRELATION FUNCTIONS 

Two point correlation functions are a powerful tool to de- 
scribe the clustering properties of galaxies. With the CLFs 
for early- and late-type galaxies we can compute the galaxy- 
galaxy correlation function, £ gg (r), not only as function of 
galaxy type, but also as function of luminosity. This allows 
a detailed investigation of the bias of galaxies as function of 
scale, luminosity, and (spectral) type. 

At small scales, where the number of pairs are mostly 
due to galaxies within the same halo, £ gg (r) depends not 
only on the occupation numbers of galaxies (which can be 
obtained from the CLF), but also on how galaxies are dis- 
tributed inside individual dark matter haloes. In addition, 
since the number of pairs within a halo containing TV galax- 
ies is equal to |TV(TV — 1) one also needs to know the second 
moment of the halo occupation number distribution. Since 
the CLF only contains information about the first moments 
(N(M)} we need to make additional assumptions. Alterna- 
tively, as shown by Cooray (2002), one may use the ob- 
served power spectrum of galaxies to obtain constraints on 
the second moments of P(N\M) using standard inversion 
techniques. 

We follow Yang et al. (2002) and make the assumption 
that the probability distribution P{N\M), with TV an inte- 
ger, is given by 

r TV' + 1 - (TV(M)) if TV = TV' 
P(N\M) = l {N(M))-N' if TV = TV' + 1 (31) 
[ otherwise 

Here TV' is the integer of (N(M)}. Thus, the actual, integer 
number of galaxies in a halo of mass M is either TV' or A' + 1. 
This particular model for the distribution of halo occupation 
numbers is supported by the semi-analytical models of Ben- 
son et al. (2000) who found that the halo occupation prob- 
ability distribution is narrower than a Poisson distribution. 
In addition, they have shown that distribution (31), which 
they call the "average" distribution, is successful in yielding 
power-law correlation functions, much more so than for ex- 
ample a Poisson distribution (see also Berlind & Weinberg 
2002). For this "average" distribution the average number of 



galaxy pairs inside an individual dark matter halo is given 
by 

<TV pair (M)> = TV' <TV(M)) - iiV' (TV' + 1) (32) 

For the spatial distribution of galaxies within a halo 
we follow Peacock & Smith (2000), Benson et al. (2000) and 
Berlind & Weinberg (2002) and assume that (i) the brightest 
galaxy resides in the center, and (ii) the remaining 'satellite' 
galaxies follow the density distribution of the dark matter 
(which we specify in Appendix A). The special treatment of 
a central galaxy is required if the galaxy-galaxy correlation 
function is to remain close to a single power-law, rather than 
to reveal a flattening as present in the dark matter mass cor- 
relation function (Peacock & Smith 2000; Berlind & Wein- 
berg 2002). Pairs between the central galaxy and satellite 
galaxies follow a separation function / cs (r) = 47r/5(r) r 2 , with 
p(r) the normalized density distribution of the halo, which 
integrates to a total halo mass of unity. Satellite-satellite 
pairs follow a different separation function / ss (r) which can 
also be obtained from the halo density distribution. The sep- 
aration function for all pairs, f(r), can be written as 

<TV pair (M))/(r)dr = (TV CS (M)) f cs (r) dr + 

{N BB (M)}f BB (r)dr (33) 

where (N CB (M)) and (N BB (M)) correspond to the average 
number of central-satellite and satellite-satellite pairs, re- 
spectively. 

Once the CLF and (TV pa i r (M)}/(r) are specified, £ gg (r) 
can be computed (see Appendix A). Figure 6 plots £ gg (r) for 
the late- type galaxies (dotted lines), the early- type galaxies 
(dashed lines) and the combined sample of galaxies (solid 
lines) for different luminosity bins and models. In addition, 
we plot the dark matter mass correlation function (dot- 
dashed lines) for comparison. Panels in the upper three rows 
show results for galaxies in magnitude intervals for which 
data from the 2dFGRS exists (Norberg et al. 2002). The 
panel in the lower row corresponds to — 14 > M b j — 51ogft > 
— 18. Since Norberg et al. only presented correlation func- 
tions for galaxies with Mb, — 51og/i < —17.5, no data on 
£ gg (r) exists for this magnitude range. 

Virtually all correlation functions, including that of the 
dark matter, reveal a special feature in the radial interval 
l/i -1 Mpc < r < 3/i _1 Mpc. This coincides with the tran- 
sition from the 1-halo term to the 2-halo term (see Ap- 
pendix A): at r < lh^ 1 Mpc the correlation functions are 
governed by galaxy pairs within individual dark matter 
haloes and therefore depend on our assumptions regarding 
(TV pa i r (M))/(r). For r >, 3fe _1 Mpc, however, the galaxy cor- 
relation functions are independent of how galaxies are dis- 
tributed inside haloes and only depend on the first moments 
(TV(M)} of the halo occupation number distributions. 

Typically, at r <, 2/jT 1 Mpc the ratio of the correlation 
functions of the early- and late- type galaxies is larger than at 
r >, 2h~ Mpc. Since at small scales the clustering strength 
depends on the second moment of the occupancy numbers, 
the correlation function at small r is more dominated by 
the contribution from massive haloes than at large r. The 
ratio between the £ gg (r) of early- and late-type galaxies at 
small r is therefore sensitive to !Fi(M) at large M. This 
is immediately apparent from a comparison with the lower 
left panel of Figure 3, which shows that models A and C are 



© 0000 RAS, MNRAS 000, 000-000 



12 van den Bosch, Yang & Mo 

Model A Model B 



3 
2 
1 

-1 

3 
2 
1 

■1 

3 
2 
1 

-1 

3 
2 
1 

-1 





" I " " I " 



1 1 1 1 1 '+' 1 1 1 1 1 1 1 1 1 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : : l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : : l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : : l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



■19.0 > M' > -20.0 ± 



-14.0 > M' bj > -18.0 



I i i i i I i i i i I i i i i I i i + i i i i I i i i i I 



1 1 I 1 '+' 1 1 1 I 1 1 



Model C 



1 1 1 1 1 1 1 1 1 '+' 1 1 1 1 



Model D 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 m i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




: i ll 1 1 II 1 1 II III 1 1 ll 1 1 1 ll : : l I ll 1 1 1 ll 1 1 1 ll 1 1 1 ll 1 1 1 : : l 1 1 III ll I II 1 1 1 II 1 1 1 II 




1 1 i+ i 1 1 1 1 



" I " " I " 



1 1 1 1 1 1 1 i + i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



-1 -0.5 0.5 1 -1 -0.5 0.5 1 -1 -0.5 0.5 1 -1 -0.5 0.5 1 

log[r] (h-'Mpc) 

Figure 6. Real-space correlation functions of all galaxies (solid lines), late-type galaxies (dotted lines), and early- type galaxies (dashed 
lines). Results are shown for four different models, and for four different magnitude bins (as indicated). Note that M' b is defined as 
Mf, — 51og/i. The dot-dashed curves in each panel correspond to the real-space correlation function of the evolved, non-linear dark matter 
mass distribution, and is plotted for comparison. Note that the ratio between the correlation functions of early- and late-type galaxies 
increases with decreasing r. See text for a detailed discussion. 



very similar with lower late-type fractions than models B 
and D. Consequently, the ratio of £ gg (j") of early- and late- 
type galaxies at small r is larger for models A and C than 
for models B and D. 

As is also immediately apparent from Figure 6, the bias 
of galaxies with respect to the dark matter mass distribution 
depends sensitively on galaxy luminosity, on the radial scale, 
and on galaxy type (see also Kauffmann, Nusser & Steinmetz 
1997). In general, early-type galaxies are biased tracers of 
the mass distribution at all scales, and for all luminosities, 
whereas late-type galaxies are virtually always anti-biased. 



The combined sample of early- and late-type galaxies reveals 
a transition from positive bias at large scales (r >, 2ft~ 1 Mpc) 
to anti-bias at smaller scales, except for the most luminous 
galaxies which are always positively biased. 

6.1 Comparison with observations 

Can we compare these £gg( r ) to those obtained from the 
2dFGRS? Norberg et al. (2002) argued that the real-space 
correlation functions of all subsamples of the 2dFGRS data 
are well fit by a single power-law (independent of luminos- 
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Figure 7. Same as Figure 6, except that here we plot the projected correlation function w p (r p )/r p instead of the real-space correlation 
function £(r ). Note that these projected correlation functions, which unlike the real-space correlation functions can be directly obtained 
from redshift surveys, are much smoother than the real-space correlation functions and much better resemble single power-laws. 



ity or spectral type). Taken at face value, the fact that 
virtually all correlation functions shown in Figure 6 reveal 
clear deviations from pure power-laws would force us to rule 
against each of our models. However, it is important to take 
into account how real-space correlation functions are ob- 
tained from data. After all, the data only yields a redshift- 
space correlation function, which is distorted with respect 
to the real-space correlation function due to the peculiar 
motions of the galaxies. On small scales the virialized mo- 
tion of galaxies within dark matter haloes cause a reduction 
of the correlation power (the so-called "finger-of-God" ef- 
fect), while on larger scales the correlations are boosted due 
to coherent flows (Kaiser 1987). It is common practice to 



therefore compute the galaxy correlation function on a two- 
dimensional grid of pair separations parallel (n) and perpen- 
dicular (r p ) to the line-of-sight. Integration of this £gg( r p> tt) 
over it then yields what is called the projected correlation 
function, which is related to the real-space correlation func- 
tion by an Abel transform 



w p (r p ) 



f 

J — ; 



£gg(rV,7r)d7r = 2 / £ gg (r 



r dr 



(34) 



(Davis & Peebles 1983). If the real-space correlation func- 
tion is a single power-law, £ gg (r) = (r/ro) 7 then the pro- 
jected correlation function also follows a single power-law 
w p (r p )/r p = A(-y)(r p /r y with 
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Figure 8. The power-law slopes 7 and zero-points wo of the projected correlation functions of galaxies within a one magnitude luminosity 
bin, obtained from a single power-law fit over the range —0.5 < log(r p ) < 1.3. Symbols with errorbars correspond to the 2dFGRS data 
of Norberg et al. (2002), while the various lines indicate predictions from our models, as indicated. The overall agreement with the data 
is reasonable. Except for the relatively faint early-type galaxies, differences between the various models are small. Further constraints 
will have to await larger redshift surveys (i.e., smaller errorbars). 



A( 7 ) 



r(i/2)r(-( 7 + i)/2) 

IX-7/2) 



(35) 



(see e.g., Baugh 1996). 

In Figure 7 we plot w p (r p )/r p (obtained using equa- 
tion [34]) for the same samples of galaxies as in Figure 6. 
As can be seen, the projection has largely washed-out the 
features at ~ 2h~ 1 Mpc in the real-space correlation func- 
tions, and overall the projected correlation functions better 
resemble single power-laws. Only the correlation functions 
of early-type galaxies with — 14 > M(, ; — 51ogft > —18, for 
which no data exists to date, clearly deviates from a single 
power- law. 

In order to compare our models to the 2dFGRS data 
in a more quantitative way, we proceed as follows. For each 
spectral type and each luminosity bin Norberg et al. (2002) 
have fit a single power-law relation of the form w p (r p )/r p = 
A(7)(r p /r ) 7 to the projected correlation functions obtained 
from the 2dFGRS data. We rewrite this as 



log[w p (r p )/r p ] =w + 7log(r p ) 



(36) 



and compute wo = log A (7) — 7logro and its error from 
the values of r and 7 (and their errors) quoted by Nor- 
berg et al. (2002). The results are shown in Figure 8 (open 
circles with errorbars). Next, for each of our models we fit 



equation (36) to the projected correlation functions over the 
same radial interval over which Norberg et al. (2002) fitted 
the data (-0.5 < log(r p ) < 1.3). 

Our best-fit wo and 7 are shown in Figure 8 for all mod- 
els listed in Table 1. Overall, the models fit the data reason- 
ably well. Although the models predict somewhat more pro- 
nounced dependencies of 7 and wo on luminosity, and some- 
what shallower correlation functions for the late-type galax- 
ies, the errorbars on the data are too large to rule against any 
of these models. The level of agreement is actually quiet sur- 
prising in light of the oversimplified assumptions regarding 
(iVpa,i r (M)) and f(r). In particular, the assumption of sub- 
Poissonian occupation distributions may be a poor assump- 
tion on the scale of clusters (Benson et al. 2000; Scoccimarro 
et al. 2001, Cooray 2002). In addition, we have assumed that 
f(r) is independent of galaxy type and or luminosity (except 
for the brightest galaxy, which is always assumed to reside 
at the center). However, it is well known that clusters of 
galaxies reveal a radial morphology segregation (Abraham 
et al. 1996; van Dokkum et al. 1998; Balogh et al. 1999), 
with early-type (late-type) galaxies more confined towards 
the center (outer regions). Although morphological segrega- 
tion within individual haloes is easy to take into account 
(e.g., Scranton 2002a,b), such a detailed investigation is be- 
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Figure 9. The power-law slopes 7 and zero-points u>o of the projected cross-correlation functions with bright (—21 > M^. — 51ogft > —22) 
early-type galaxies. As in Figure 8 7 and wo are obtained from a single power-law fit over the range —0.5 < log(r p ) < 1.3. 



yond the scope of this paper. However, we intend to return 
to this issue in a future paper. 

Figure 8 also shows predictions for 7 and wo for galax- 
ies down to Mb j — 51ogft = —15.5. This shows that the 
main difference between the various models occurs for faint 
early- type galaxies. Furthermore, whereas the models pre- 
dict very little luminosity dependence in the projected cor- 
relation function of late-type galaxies (except for a modest 
increase of the normalization with luminosity), for the early- 
type galaxies a very pronounced luminosity dependence is 
predicted. In particular, the models predict that early-type 
galaxies with Mb, — 51og/i = —19.5 (i.e., roughly L*), are 
less strongly clustered and with a shallower power-law slope 
than both brighter and fainter early-type galaxies (except for 
model E). Future data from the completed 2dFGRS and/or 
the SDSS will prove very useful in further testing and con- 
straining the CLF models presented here. 

In summary, once assumptions are made regarding the 
second moments of the halo occupation numbers and the 
spatial distribution of galaxies within individual dark mat- 
ter haloes, the two-point correlation functions of galaxies in 
any magnitude interval can be computed from the CLFs. 
Because the fraction of late-type galaxies decreases with in- 
creasing halo mass, late-type galaxies are less strongly clus- 
tered than their early-type counterparts at small separa- 



tions. This results in steeper projected correlation functions 
for the early-type galaxies, in qualitative agreement with the 
data. The remaining discrepancies, although small, are most 
likely a reflection of the fact that we have not taken the spa- 
tial segregation of morphological types in clusters into ac- 
count. Finally we stress that accurate correlation functions 
of faint early-type galaxies are required in order to break 
the degeneracy present in the current models for the CLF. 



6.2 Cross correlation functions 

In addition to the (auto) correlation functions presented 
above, the CLFs also allow the computation of various cross 
correlation functions. As an example, Figure 9 plots the 
zero-point wo and power-law slope 7 of the projected cross- 
correlations with bright (—21 > M bj — 51og/i > —22) early- 
type galaxies. Compared to the auto-correlation functions 
(Figure 8) the cross correlation functions are steeper and 
with a higher zero-point wo- This is due to the fact that 
bright early-type galaxies reside predominantly in massive 
clusters. In addition, compared to the auto-correlation func- 
tions, the late-type galaxies reveal a more pronounced de- 
pendence of 7 on luminosity. This is due to the fact that 
clusters are predominantly occupied by early-type galaxies: 
Faint late-type galaxies reside mainly in low-mass haloes, 
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Figure 10. Halo occupation numbers for the same magnitude bins and models as in Figures 6 and 7. Dotted and dashed lines correspond 
to the late- and early-type galaxies, respectively. See text for a detailed discussion. 



and are therefore relatively weakly correlated with bright 
early-type galaxies. The brightest late-type galaxies, on the 
other hand, are still predominantly found in massive clus- 
ters, and therefore strongly correlated with the bright early- 
types. Note that these predictions are likely to depend fairly 
strongly on the presence of a radial morphology segregation 
in clusters, which is not taken into account here. Therefore, 
Figure 9 should not be taken as a strong prediction, but 
merely serves to illustrate the power of the CLF method for 
computing various statistical properties of the galaxy distri- 
bution. 



7 HALO OCCUPATION NUMBERS 

We now use the CLFs to compute halo occupation num- 
bers using equation (10). Results are shown in Figure 10 
for the same samples of galaxies as in Figures 6 and 7. Al- 
though there are small differences amongst the four models 
presented here, several trends are shared by all models. First 
of all, one finds that the derivative d{N)/dM is larger when 
brighter galaxies are considered. Secondly, when sufficiently 
faint galaxies are included, d{N) /AM depends more strongly 
on mass (i.e., (N(M)) deviates more from a single power- 
law) in the sense that it decreases towards lower M. We can 
understand this behavior by focusing on the contributions 
of the late- and early-type galaxies. In all cases we find that 
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Figure 11. A comparison of the halo occupation numbers ob- 
tained by Peacock & Smith (2000, solid dots) and our CLFs for 
models A, B, C, and D (various lines). The open circles corre- 
spond to the halo occupation numbers obtained after replacing 
N with TV 0,92 as suggested by Peacock & Smith for haloes at 
the high mass end. The two dotted, horizontal lines indicate the 
range in N over which Peacock & Smith were able to constrain 
their models directly from data. 

d{N)/dM is largest for the early-type galaxies. At large M 
and L, the majority of galaxies are early-types, and (N(M)) 
is well fit by a single power-law. At lower L the fraction of 
late-type galaxies increases, predominantly at lower masses, 
such that a transition occurs from being dominated by late- 
type galaxies (at low M) to being dominated by early-type 
galaxies (at high M). Finally, note that (N(M)) is always 
truncated at low M, indicating the presence of a minimum 
halo mass that can harbor a galaxy of a given luminosity. In 
terms of our models, this truncation is a reflection of L*(M). 

7.1 Comparison with previous work 

Several studies in the past have used a variety of methods 
to constrain (N(M)} from observations. Jing, Mo & Borner 
(1998), using the two-point correlation function and pairwise 
velocity dispersions from the Las Campanas Redshift Survey 
(LCRS; Shectman et al. 1996), obtained N oc M 92 . Scoc- 
cimarro et al. (2001) fitted the measurements of counts-in- 
cells from the APM survey (Maddox et al. 1990) deprojected 
into three dimensions by Gaztanaga (1994), and found that 
N oc M°' s is required in the high mass limit. 

Unfortunately, a direct comparison of these results with 
each other or with predictions from our CLFs is complicated 
for several reasons. First of all, the results obtained by Jing 
et al. (1998) and Scoccimarro et al. (2001) are based on 
apparent magnitude limited samples, which are difficult to 
compare to halo occupation numbers in absolute magnitude 
ranges, such as those obtained from the CLFs. Secondly, the 
LCRS is based on i?-band data for which halo-occupation 



numbers are not expected to be the same as in the ftj-band 
used here and by Scoccimarro et al. In fact, as shown by 
Jing et al. the two-point correlation function of the LCRS is 
significantly steeper than that of the APM survey on which 
the 2dFGRS data is based. 

Peacock & Smith (2000; hereafter PS00), on the other 
hand, presented halo occupation numbers for galaxies with 
Mb j < —19.0, which we can compare directly to predic- 
tions from our CLFs. Using data on galaxy groups from 
the CfA survey (Ramella, Pisani & Geller 1997) and the 
ESO Slice Project (Ramella et al. 1999), PS00 found that 
the number density of groups containing N galaxies obeys 
p{N) oc N~ 2 ' 7 . Equating the number density of groups with 
multiplicity larger than N to the number density of dark 
matter haloes with mass larger than M, results in the halo 
occupation numbers shown by solid dots in Figure 11. For 
comparison, we also show the predictions for our CLF mod- 
els A, B, C, and D (various lines). Over the intermediate 
mass range 2 x 10 13 fe _1 M Q < M < 5 x lO 14 ^ 1 M the 
PS00 estimates agree well with the occupation numbers ob- 
tained here. However, at both lower and higher masses the 
(N(M)) of PS00 are significantly higher than in any of our 
four models. The data used by PS00 was restricted to groups 
with 3 < N < 20 (indicated by dotted, horizontal lines). For 
N > 20 PS00 simply extrapolated the power-law behavior 
of p(N) to higher multiplicity. As PS00 argued themselves, 
these results may not be completely reliable, and PS00 there- 
fore suggested to replace N with A ' 92 at the high mass end. 
This is indicated by the open circles in Figure 11 and brings 
the PS00 results in good agreement with the (N(M)} of our 
models B, C, and D for haloes with M > 2 x 10 13 /i _1 M . 
For N < 3 the disagreement remains. However, here PS00 
extrapolated the power-law behavior of p(N) down to N — 2 
and obtained the result for N — 1 by matching the number 
density of galaxies to the observed value. Given the uncer- 
tainties involved, it is encouraging that these two results, 
based on completely different methods, agree as well as they 
do. 



8 COMPARISON WITH SEMI-ANALYTICAL 
MODELS 

In this paper we have derived conditional luminosity func- 
tions for late- and early-type galaxies. We have shown that 
although some degeneracy remains, the overall behavior of 
(M/L) and (N(M)} is well constrained. We thus have ob- 
tained important constraints on how galaxies have to be 
distributed over dark matter haloes in order to yield lumi- 
nosity functions and correlation lengths in agreement with 
the data. The important question that arises is whether such 
halo occupancy distributions can be obtained within the 
standard framework for galaxy formation; i.e., how do the 
cooling, feedback and star formation efficiencies depend on 
halo mass, such that one can reproduce the CLFs obtained 
here. 

In order to address this question we compare our results 
with several semi-analytical (SA) models in the literature. 
These SA models use various phenomenological prescrip- 
tions to describe the star formation, feedback, and cooling 
efficiencies which are linked to merger histories of dark mat- 
ter haloes obtained from either the extended Press-Schechter 
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Figure 12. Comparison of the semi-analytical models of Kauffmann et al. (1999; K99) with models A, B, C, and D. Upper left panel 
plots the halo occupation numbers for galaxies with M b — 51og/i < —19.5. Symbols correspond to the semi-analytical models, and lines 
to the CLF models, as indicated. Compared to our models, the SA-model underpredicts the average number of galaxies in massive haloes. 
The upper right panel plots the corresponding ratio of halo mass to the total luminosity Lig.5 contributed by galaxies brighter than 
Mbj — 51og/i = —19.5. The K99 model predicts higher (lower) values of M/L19.5 compared to the CLFs presented here at the high (low) 
mass end. The lower right panel plots the mass distribution of haloes hosting galaxies with M b] — 51ogft < —19.5. Again, the agreement 
between the K99 model and our CLF models is poor, with the latter predicting much broader halo mass distributions. The reason for 
these various discrepancies is that unlike for the CLF models presented here, the LF of the K99 model is strongly inconsistent with the 
observed LF. This is indicated in the lower left panel where symbols correspond to the 2dFGRS LF of Madgwick et al. (2002) while the 
histogram corresponds to the LF of the model galaxies of K99. The vertical dotted line indicates M;,, — 51og/i = —19.5. 



formalism (e.g., Lacey & Cole 1993), or directly from numer- 
ical simulations. Combined with stellar population models 
and recipes for the merging of galaxies within merging dark 
matter haloes these models yield, amongst others, luminosi- 
ties of galaxies in haloes of different masses. 

Here we compare our models with the SA models of 
Kauffmann et al. (1999; hereafter K99), Mathis et al. (2002; 



hereafter M02), and Benson et al. (2002; B02)t. Each of 
these three models used exactly the same cosmological pa- 



t The models of K99 and M02 are publicly available at 
http://www.MPA-Garching.MPG.DE/~virgo/virgo/index- 
galaxy.html while the model of B02 was kindly provided to us 
by Andrew Benson. 
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Figure 13. Same as Figure 12, except that here we compare our models to the SA model of Mathis et al. (2002; M02) for galaxies with 
Mj,j — 51ogfe < —18.5. Since the LF of this SA model is in somewhat better agreement with the observations (see lower left panel), the 
overall agreement with our CLF models has also improved, though only marginally. 



rameters as adopted here. K99 and M02 used numerical sim- 
ulations to compute the merger histories of the dark matter 
haloes, and their models are consequently limited in halo 
mass by numerical resolution. The model of B02, on the 
other hand, is based on merger histories obtained using the 
extended Press-Schechter formalism (for details see Cole et 
al. 2000) , and therefore allows a comparison with our models 
over a much larger range of halo masses and luminosities. 
The results of K99 and M02 are based on the same semi- 
analytical model, although the parameters used are some- 
what different. Luminosities in the K99 model have not been 
corrected for dust extinction, whereas those in M02 have 
(though we have verified that this has only a very small 
effect on the results presented below). Both K99 and M02 



tuned their model parameters to fit primarily the zero-point 
of the Tully-Fisher (TF) relation. This results in rather poor 
fits to the observed LF (see below). The model of B02 is sig- 
nificantly different. It is based on the semi-analytical models 
of Cole et al. (2000) but with (i) an improved prescription 
for the effects of tidal stripping and dynamical friction, and 
(ii) a model for feedback from the photoionizing background. 
Unlike in K99 and M02, B02 tune their parameters to re- 
produce the 2dFGRS LF of Madgwick et al. (2002), which 
results in a TF zero-point that is offset from the observed 
value (see discussion below). 

Before we can make a fair comparison we need to adopt 
a consistent definition of halo masses. Both the SA models 
of K99 and M02 provide two measures for the halo mass 
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Figure 14. Same as in Figures 12 and 13 but for the semi-analytical model of Benson ct al. (2002; B02) for galaxies with M^j — 51og/i < 
— 16.5. Contrary to those of K99 and M02, this model agrees well with both our CLF models and with the observed 2dFGRS LF. Since B02 
tuned their model parameters to reproduce this particular LF, the latter does not come as a surprise. However, the good agreement for 
the halo occupation numbers, mass-to-light ratios, and halo mass distributions with our CLF models is remarkable given the completely 
different nature of both techniques. The remaining disparities between the SA model and our CLF models can be contributed to the 
remaining disagreement between the observed LF and that of the B02 model. 



associated with each galaxy; Mpof, the mass of the halo 
obtained using the standard friends-of-friends (FOF) algo- 
rithm (Davis et al. 1985) with a linking length of b — 0.2 
times the mean inter-particle separation, and M200 , the mass 
of a sphere centered on the most-bound-particle of the FOF 
group with a radius inside of which the average density is 
200 times the critical density. For each individual halo Mpof 
and M200 can differ substantially, and therefore the results 
are sensitive to which halo mass we adopt. Since our mod- 
els rely on the Sheth & Tormen (1999) halo mass function, 
and since Jenkins et al. (2001) have shown that this is in 



very good agreement with the mass function of FOF groups 
with a linking length of b — 0.2, we use Mpof- B02 define 
their halos as spheres with an average overdensity of 320. 
We convert these masses to Miso, consistent with the def- 
inition of halo mass used throughout this paper, assuming 
that dark matter halos follow the NFW density distribu- 
tion (Navarro, Frcnk & White 1992) with a concentration 
parameter given by Eke, Navarro & Steinmetz (2001). Next 
we need to be careful about the use of luminosities. The SA 
models give for each galaxy the absolute magnitudes in the 
B and V bands. In order to allow a fair comparison with 
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our models, we compute for each galaxy Mbj — 51og/t using 
bj = B- 0.28(5 - V) (Blair & Gilmore 1982) and adopting 
the Hubble constant used in the models (h = 0.7). 

We first compare our results with the model of K99. 
Because of the numerical resolution of the simulation used 
by K99, only galaxies with a stellar mass M» > 1.4 x 
10 10 /i _1 M are available. Although this includes galaxies 
down to Mb j — 51ogft ~ —18, the sample of model galax- 
ies is only complete down to Mb , — 51og/i = —19.5. When 
comparing for instance halo occupation numbers, complete- 
ness is important and we therefore limit ourselves here to 
a comparison of galaxies with M bj — 51og/i < —19.5. The 
upper left panel of Figure 12 plots the number of galax- 
ies with Mb , — 51ogft < —19.5 as function of halo mass. 
Open circles correspond to the K99 model, with errorbars 
indicating the rms scatter. In addition, we show the pre- 
dictions for models A, B, C, and D (various lines). Differ- 
ences between these four models are typically small. For 
M > 10 13 /i _1 M the SA model predicts fewer galaxies 
per halo than our CLF models, while the opposite is true 
for haloes with M ~ 5 x 10 12 /i _1 M . The upper panel 
on the right plots (M/L19.5) as function of halo mass. 
Here -L19.5 is defined as the total luminosity of galaxies 
with Mb r — 51og/i < —19.5 in each individual halo. For 
M > 10 14 /i _1 M the SA model overpredicts (M/L19.5) 
compared to our CLF models, in agreement with the fact 
that K99 predicts lower occupation numbers. However, at 
M ~ 10 13 /i _1 M , where both the SA and CLF models 
predict on average one galaxy per halo, the former predicts 
significantly lower (M/L19.5). This indicates that the aver- 
age luminosity of galaxies in haloes with M = lO 13 /i _1 M is 
higher in the SA model than in our CLF models. Finally, the 
lower right panel plots the mass distribution of haloes har- 
boring galaxies with Mb, — 51og/i < —19.5. Note that a halo 
containing N such galaxies is counted N times. The hatched 
histogram corresponds to the SA model, whereas the various 
lines show predictions for our CLF models computed from 



P{M)AM : 



J L °° $(L|M)dL 



n(M) AM. 



(37) 



with L m in the lower luminosity limit of the sample. Again 
the agreement between the SA and CLF models is extremely 
poor, with the latter predicting much broader mass distribu- 
tion functions. None of these disagreements, however, come 
as a surprise. After all, as shown in the lower left panel, the 
K99 model predicts a LF that is in very poor agreement 
with the observed LF of the 2dFGRS, and thus also with 
the LFs corresponding to our CLF models. 

Figure 13 shows a similar comparison but now with the 
SA model of M02. Since the numerical simulations used by 
M02 have somewhat higher numerical resolution, we can ex- 
tend our comparison down to Mb, — 51ogft = —18.5. Mathis 
et al. (2002) focussed on modeling the local galaxy pop- 
ulation, using numerical simulations constrained to repro- 
duce the local smoothed linear density field out to a dis- 
tance of 8000 kms -1 . Since this corresponds to a cosmo- 
logical volume that is smaller than that in K99, the error- 
bars are somewhat larger. As shown in the lower left panel, 
the M02 model significantly improves the agreement with 
the observed LF at the bright-end compared to K99, even 
though it still severely underpredicts the number of galaxies 



with Mb j — 51og/i >, — 20. This explains why the SA model 
predicts (i) smaller occupation numbers, (ii) higher mass-to- 
light ratios, and (iii) a higher mean halo mass for galaxies 
with Mbj — 51ogh < —18.5 compared to our CLF models. 

Finally, in Figure 14 we compare our models to the SA 
model of B02. Since numerical resolution does not play a 
role here, we extend the comparison down to galaxies with 
Mb , — 51og/i = —16.5. As shown in the lower left panel, the 
B02 model yields a much better fit to the observed LF than 
either the K99 or M02 models. Since B02 chose to tune their 
model parameters to fit this particular LF, rather than the 
zero-point of the TF relation, this does not come as a sur- 
prise. The good agreement with our CLF models, however, is 
not a trivial result. Although some amount of disagreement 
remains, this can all be attributed to the fact that the B02 
model slightly over(under)-predicts the LF at luminosities 
just above (below) L* . All in all, the agreement is extremely 
encouraging and seems to suggest that our CLFs correspond 
to a galaxy formation scenario that is very similar to the SA 
model of B02. 

In summary, our CLF models imply halo occupation 
numbers and mass-to-light ratios that are in poor agree- 
ment with the semi-analytical models of K99 and M02, but 
in surprisingly good agreement with the model of B02. How- 
ever, this does not mean that the B02 models are better 
or more reliable than the models of K99 and M02. Rather, 
the differences between the various SA models reflect a well 
known problem in galaxy formation theory, namely the in- 
ability of models to simultaneously fit the observed LF and 
zero-point of the TF relation (e.g., Kauffmann et al. 1993; 
Cole et al. 1994, 2000; Somerville & Primack 1999; Benson 
et al. 2000). In fact, this does not seem a problem related to 
the particular prescriptions for star formation and/or feed- 
back used in these models, but to reflect a more fundamental 
problem related to the particular cosmological model: as wc 
have shown in detail in Paper 1, even the CLF models, which 
are independent of how galaxies form, are inconsistent with 
the TF zero-point. Since both the SA model of B02 and 
our CLF models are tuned to fit the LF, one should not 
expect good agreement regarding halo occupation numbers 
with the K99 and M02 models, which are tuned to fit the TF 
zero-point. However, it is extremely encouraging that the SA 
model that does fit the observed LF agrees remarkably well 
with our CLF models. The fact that these two wildly dif- 
ferent techniques imply consistent mass-to-light ratios and 
halo occupation numbers, is an important step forward in 
our quest for a coherent picture of galaxy formation within 
a CDM cosmology. The question whether the occupation 
distributions inferred from our CLFs can be made consis- 
tent with the standard framework for galaxy formation thus 
seems to have a positive answer. 



9 CONCLUSIONS 

Data from large ongoing redshift surveys such as the SDSS 
and 2dFGRS provide a wealth of information on the cluster- 
ing properties of galaxies as function of morphological type, 
luminosity, surface brightness, star formation rate, etc. At 
the same time, numerical simulations and sophisticated an- 
alytical techniques are continuously improving our under- 
standing of the clustering properties of dark matter haloes 
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within CDM cosmologies. The ultimate challenge is to con- 
strain how galaxies with different properties occupy haloes of 
different masses. This information not only gives an easy to 
interpret description of the galaxy-dark matter connection, 
but also yields important constraints on galaxy formation 
theories. 

In this paper we have used recent observations from 
the 2dFGRS to constrain the CLFs of early- and late-type 
galaxies. Although some amount of degeneracy remains, the 
CLFs are well constrained. They indicate that the average 
mass-to-light ratios of dark matter haloes have a minimum 
of ~ 100ft (M/L) around a halo mass of ~ 3 x lO 11 ^ 1 M . 
Towards lower masses (M/L) increases rapidly, and match- 
ing the faint-end slope of the observed LF requires that 
haloes with M < lO lo ft _1 M are virtually devoid of galaxies 
producing light. At the high mass end, the observed cluster- 
ing properties of galaxies require that clusters have mass- 
to-light ratios (in the photometric bj band) in the range of 
roughly 5OOft(M/L) to lOOOft(M/L) . Finally, the fact that 
early-type galaxies are more strongly clustered than late- 
type galaxies requires that the fraction of late-type galaxies 
is a strongly declining function of halo mass, although the 
exact relation is currently poorly constrained. Forthcoming 
data from both the SDSS and the 2dFGRS will reduce the 
errors on the current measurements and provide correlation 
lengths down to fainter magnitudes, both of which are es- 
sential to further constrain the CLFs. 

The halo occupation numbers (N(M)} implied by these 
CLFs depend rather strongly on the magnitude interval of 
galaxies considered. Typically, including fainter galaxies re- 
sults in weaker mass dependencies and a stronger departure 
from a single power-law form. This is mainly a reflection of 
the way in which early- and late-type galaxies contribute: 
early-type galaxies reveal a steeper (N(M)} than the late- 
type galaxies. When only bright galaxies are considered, 
(N(M)) is well fit by a single power-law (with a low mass 
cut-off) and is dominated by early-type galaxies. When suf- 
ficiently faint galaxies are included, the late-type galaxies 
start to dominate at the low mass end. Since these reveal 
a shallower (N(M)} than early-type galaxies, the shape of 
(N(M)) becomes more complicated. We have compared our 
halo occupation numbers with those obtained by Peacock 
& Smith (2000) from the multiplicity of galaxy groups. Ex- 
cept for haloes with (N) < 3, where the results of Peacock 
& Smith are uncertain due to unreliable extrapolation, the 
agreement is remarkably good. 

Once assumptions are made about the second moment 
of the halo occupation number distributions and the spatial 
distribution of galaxies within individual haloes, the two- 
point galaxy-galaxy correlation function can be computed 
from the CLFs presented here for any magnitude bin and 
for both early- and late-type galaxies (as well as their cross 
correlation). We presented a number of correlation functions 
under the assumptions that (i) P(N\M) is sub-Poissonian, 
(ii) the brightest galaxy always resides at the halo center, 
and (iii) the remaining satellite galaxies follow the density 
distribution of the dark matter. In order to facilitate a com- 
parison with real data we also computed the projected cor- 
relation functions. For late-type galaxies, we find that the 
power-law slopes of these projected correlation functions are 
virtually independent of luminosity. For early-type galaxies, 
on the other hand, a fairly strong luminosity dependence is 



predicted; early-type galaxies with luminosities around L* 
are less strongly clustered, and with a shallower power-law 
slope, than early type galaxies with either L > L* or L < L* . 
A direct comparison with the slopes and zero-points of the 
projected correlation functions obtained from the 2dFGRS 
reveals good agreement. We also presented predictions for 
the cross correlation functions with bright early-type galax- 
ies. Typically, these are stronger and steeper than the corre- 
sponding auto-correlation functions, which is a direct con- 
sequence of the fact that bright early-types reside predomi- 
nantly in massive clusters. 

Finally we have compared halo occupation numbers and 
mass-to- light ratios inferred from our CLFs with predictions 
from three different semi-analytical models for galaxy forma- 
tion. Overall, the agreement between our models and the SA 
models of both K99 and M02 is extremely poor. However, 
the reason for this discrepancy is well understood. Both K99 
and M02 have normalized their models to fit the zero-point 
of the TF relation. It is well known, that current models 
for galaxy formation fail to simultaneously fit the TF zero- 
point and the observed LF of galaxies. In fact, exactly the 
same problem was identified in Paper 1 based on modeling 
the CLF. Indeed, the models of K99 and M02 yield LFs that 
differ strongly from that of the 2dFGRS, and thus from those 
of our CLF models. It should not come as a surprise that 
therefore the halo occupation numbers and mass-to-light ra- 
tios do not match. Benson et al. (2002), however, presented a 
SA model that was normalized to fit the observed 2dFGRS 
LF. The agreement of this particular SA model with our 
CLF models is remarkably good. Given the completely dif- 
ferent nature of both techniques to predict halo occupation 
numbers and mass-to-light ratios, this agreement is far from 
trivial. In fact, it indicates that the technique used here has 
recovered a statistical description of how galaxies populate 
dark matter haloes which is not only in perfect agreement 
with the data, but which in addition fits nicely within the 
standard framework for galaxy formation. 
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APPENDIX A: THE TWO-POINT 
CORRELATION FUNCTION 

The two-point correlation function of the evolved, non-linear 
dark matter mass distribution is given by 

, . f°° dk . o , , . sin kr , . _ . 

Cdm(r) = X T^ {k) n^r (A1) 

Here 

A 2 Nh (k) = ^ k 3 P NL (k) (A2) 

is the dimensionless form of the evolved, non-linear power- 
spectrum PNL(fc). Throughout this paper we use the fitting 
function for A^ L (k) given by Smith et al. (2002) , and com- 
pute (dm(r) using equation (Al). 

For what follows, we split this correlation function in 
two parts: 

U(r) = {£,(r)+d h n (r). (A3) 

Here £dm corresponds to pairs within the same halo (the 
"1-halo" term), while £j m describes the correlation between 
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dark matter particles that occupy different haloes (the "2- 
halo" term). The 1-halo term is given by 



dkfr 



sin kr 
kr 



where 
S(M;k) 



-r 

Jo 



P 



dMn(M) [S{M;k)] 2 (A4) 



(A5) 



is the Fourier transform of the halo density profile p(r), trun- 
cated at the virial radius r v (e.g., Neyman & Scott 1952; 
McClelland & Silk 1977; Seljak 2000; Ma & Fry 2000; Scoc- 
cimarro ct al. 2001). We assume that p(r) has the NFW 
form 



p{r) 



8p 



(r/r s )(l + r/r s y 



(A6) 



where r s is a characteristic radius, p is the average density of 
the Universe, and 5 is a dimensionless amplitude which can 
be expressed in terms of the halo concentration parameter 

C — T"u j V s cLS 



180 (f_ 

3 ln(l + c) - c/(l + c) 



(A7) 



(cf., Navarro, Frenk & White 1997). Numerical simulations 
show that c is correlated with halo mass, and we use the re- 
lation given by Eke, Navarro & Steinmetz (2001), converted 
to the c appropriate for our definition of halo mass. Substi- 
tution of (A6) in (A5) yields, after some algebra, 



5(M; k) = AirSri 



C+S- 



sin(fcr„) 



kv s ~~\~ kv v 
(Scoccimarro et al. 2001). Here 
C = cos(fcr s )[Ci(fcr s + kr v ) — Ci(fcr s )] 
S = sin(fcr s )[Si(fcr s + kr v ) — Si(fcr s )] 



(A8) 

(A9) 
(A10) 
/ "dt sin(t)/t 



with Ci(a:) = — j°° dt cos(t)/t and Si(x) 
the cosine and sine integrals, respectively. 

In order to compute the galaxy-galaxy two-point cor- 
relation function, £gg( r ) we use the same 1-halo and 2-halo 
split as before: 



£gg(0 



4V) + 4V)- 



(All) 



For the 1-halo term we need to specify the distribution of 
galaxies in individual haloes, while the 2-halo term is given 
by the correlation of the population of dark matter haloes. 

Consider all galaxies with luminosities in the range 
[Li,L 2 ], and let £gg( r ) correspond to the two-point corre- 
lation function of this subset of galaxies. By definition, the 
total number of pairs of galaxies per comoving volume in this 
luminosity range that have separations in the range r±dr/2 
is 

n pair = ^[1-KggMlWW. (A12) 

where the factor 1/2 corrects for double counting of each 
pair. Here n g is the mean number density of galaxies with 
L G [Li, L2] which is given by 



Jo 



n(M) (N(M)) dM , 



(A13) 



and (N(M)}, given by equation (10), gives the mean number 
of galaxies in the specified luminosity range for haloes of 
mass M. 

Consider a galaxy with L € [L\,L2\ that lives in a 
halo of mass Mi. There are, on average, n(M2) 4ivr 2 dr dark 
matter haloes with mass M2 that are separated from this 
galaxy by a distance in the range r ± dr/2. In each of these 
haloes there are on average (N(M2)) galaxies that also have 
a luminosity in the range [Li,L2]. Integrating over the halo 
mass function, and taking account of the clustering prop- 
erties of dark matter haloes, we obtain the excess number 
N + of galaxy pairs per galaxy in a halo of mass Mi with 
separations in the range r ± dr/2: 

AT+(Mi) = Anr 2 dr 



/>oo 

/ n 

Jo 



(M 2 ) (N(M 2 )) Chh(r; M 1 ,M 2 ) dM 2 . 



(A14) 



Here £hh(r; Mi, M2) is the cross correlation between dark 
matter haloes of mass Mi and M2. Note that we implicitly 
assume that all galaxies in a given halo are located at the 
halo center. Since the separation between individual haloes 
is typically much larger than the separation between galaxies 
in the same halo, this simplification does not influence the 
results. 

Multiplying 7V + (Mi) with (N(Mi)} and integrating 
over the halo mass function gives the total excess number 
of galaxy pairs per comoving volume with separations in the 
required range. Combining this with equation (A12) yields 

ng£ggM4vrr 2 dr = / n(Mi)(iV(Mi)} iV+(Mi) dMi.(A15) 
Jo 

What remains is to specify the dark halo correlation 
function £ hh (r; Mi, M 2 ). Mo & White (1996) developed a 
model, based on the Press-Schechter formalism, that de- 
scribes the bias of dark matter haloes of mass M with re- 
spect to the dark matter mass distribution. According to 
this, 

U(r;M 1 ,M 2 )=b(M 1 )b(M 2 )ZZ(r), (A16) 

We use the functional form of b(M) suggested by Sheth, Mo 
& Tormen (2001), which takes account of ellipsoidal collapse, 
and which has been shown to accurately match the correla- 
tion function of dark matter haloes in iV-body simulations 
(Jing 1998; Sheth & Tormen 1999): 

1 



b(M) = 1 + 



y/a8 c (z) 

K 2 r 



\fa(av ) + \fab \av ) : c — 



(«/»)<= + 6 (l-c)(l-c/2) J' (A17) 

with a = 0.707, b = 0.5, c = 0.6 and v = S c /a(M). This bias 
is defined for haloes at 2 = 0. In Appendix B we describe 
how to compute the bias of haloes and galaxies at z > 0. 
Substituting equations (A16) and (A14) in (A15) yields 

4( r ) =*> £dm(r), 
with 



(A18) 



b = 



n{M) (N(M))b(M)dM . 



(A19) 



Equations (A18) and (A19) specify the 2-halo term of £ gg (r). 
Using a similar analysis as above, one obtains the 1-halo 
term of the galaxy correlation function as 
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— 2 poo 

Y Cgg (r)4vrr 2 dr = dr J n(M)(JV pair (M))/(r) dM, (A20) 

where (N paiT (M)) is the mean number of pairs in haloes 
of mass M, and f(r) gives the corresponding distribution 
of pair separations (e.g., Berlind & Weinberg 2002). Neither 
(iVp a i r (M)) nor f(r) can be obtained from the CLF. In order 
to compute £g g (r), additional assumptions have therefore 
to be made (see Section 6). The 1-halo term, on the other 
hand, depends only on (N(M)} and is therefore completely 
specified by the CLF. 



APPENDIX B: THE EFFECTIVE BIAS OF 
GALAXIES AT NON-ZERO REDSHIFT 

Dark matter haloes at redshift z are biased with respect to 
the dark matter mass distribution at z according to 

Z hh (r,z;M l ,M 2 )=b(M 1 ,z)b(M 2 ,z)tZ(r,z) (Bl) 

For this same population of haloes one can define a second 
bias according to 

C hh (r, 0; Mi, M 2 ) = b (M u z) b (M 2 ,z) ^(r, 0) (B2) 

where bo(M, z) describes the bias at z = of the descendants 
of haloes that have mass M at redshift z (Fry 1996; Mo &: 
White 1996, 2002). bo(M, z) can be expressed in terms of 
the bias b(M) at z = given by equation (A17) as 

bo(M, Z ) = l + ^[b(M)-l] (B3) 

(which is accurate to leading order), with D(z) the linear 
growth rate at redshift z. On sufficiently large scales, which 
are still in the linear regime, 

£hh(r,z;Mi,M 2 )=^j^ £ h h(r, 0; Mi, M 2 ) (B4) 
so that 

^ h (r,z;M 1 ,M 2 )=b eg (M 1 ,z)b eg {M 2 ,z)(,lUr,0) (B5) 
with 

MM, Z ) = ggj+&(M)-l (B6) 

Substituting b(M) in equation (A19) with b e s (M, z) one ob- 
tains the effective bias of galaxies at redshift z with lumi- 
nosities in the range L\ < L < L 2 : 



t£(r,z) y _D{z) 
e d h Jr,0) D(0) 



M^(iB^) =^ + b-l (B7) 



with 



the average bias of galaxies with Li < L < L2 at z = 0, 
which can be obtained from the CLF using equations (9) 
and (10). 
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